ENSM-SE at CLEF 2006: AdHoc Uses of Fuzzy Proximity Matching Function
نویسندگان
چکیده
Starting from the idea that the closer the query terms in a document are to each other the more relevant the document, we propose an information retrieval method that uses the degree of fuzzy proximity of key terms in a document to compute the relevance of the document to the query. Our model handles Boolean queries but, contrary to the traditional extensions of the basic Boolean information retrieval model, does not use a proximity operator explicitly. A single parameter makes it possible to control the proximity degree required. To improve our system we use a stemming algorithm before indexing, we take a specific influence function and we merge fuzzy proximity result list built with different spread of influence function. We explain how we construct the queries and report the results of our experiments in the ad-hoc monolingual French task of the CLEF 2006 evaluation campaign.
منابع مشابه
ENSM-SE at CLEF 2005: Uses of Fuzzy Proximity Matching Function
Based on the idea that the closer the query terms in a document are, the more relevant this document is, we propose a information retrieval method based on a fuzzy proximity degree of term occurences to compute document relevance to a query. Our model is able to deal with Boolean queries, but contrary to the traditional extensions of the basic Boolean information retrieval model, it does not ex...
متن کاملAmharic-English Information Retrieval
We describe Amharic-English cross lingual information retrieval experiments in the adhoc bilingual tracs of the CLEF 2006. The query analysis is supported by morphological analysis and part of speech tagging while we used different machine readable dictionaries for term lookup in the translation process. Out of dictionary terms were handled using fuzzy matching and Lucene[4] was used for indexi...
متن کاملENSM-SE and UJM at INEX 2010: Scoring with Proximity and Tag Weights
This paper presents our participation in the Relevant in Context task (ad-hoc track) during the 2010 INEX competition, and a posterior analysis. Two models presented in previous editions of INEX by the authors were merged for our 2010 participation. The first one is based on the proximity of the query terms in the documents [1] and the second one is based on learnt tag weights [2]. The results ...
متن کاملFuzzy Term Proximity With Boolean Queries at 2006 TREC Terabyte Task
We report here the results of fuzzy term proximity method applied to Terabyte Task. Fuzzy proxmity main feature is based on the idea that the closer the query terms are in a document, the more relevant this document is. With this principle, we have a high precision method so we complete by these obtained with Zettair search engine default method (dirichlet). Our model is able to deal with Boole...
متن کاملExploiting Semantic Features for Image Retrieval at CLEF 2005
This paper presents the MIRACLE’s team approach to text-based image retrieval at ImageCLEF 2005 adhoc task. The experiments defined this year try to use semantic information sources, like semantic dictionaries or text structure. For this purpose EuroWordnet has been considered and a new algorithm to extract synonyms from the semantic database has been developed. This new algorithm implementatio...
متن کامل